Goto

Collaborating Authors

 dzone big data


Using SingleStoreDB, MindsDB, and Deepnote - DZone Big Data

#artificialintelligence

This article will show how to use SingleStoreDB with MindsDB using Deepnote. We'll create integrations within Deepnote, load the Iris flower data set into SingleStoreDB, and then use MindsDB to create a Machine Learning (ML) model from the Iris data stored in SingleStoreDB. We'll also make some example predictions using the ML model. Most of the code will be in SQL, enabling developers with solid SQL skills to hit the ground running and start working with ML immediately. The notebook file used in this article is available on GitHub.


Using Machine Learning to Detect Epilepsy in Children - DZone Big Data

#artificialintelligence

Artificial Intelligence has been making impressive strides in the past year or so, with a number of medical applications utilizing AI to spot problems in medical imagery more effectively and efficiently than current methods. For instance, we've had a couple of projects using the approach to better identify cancer, eye problems, and liver disease. A recent study has set out to do a similar feat to help researchers detect epilepsy in children. The research, which was a collaborative project between Young Epilepsy, UCL Great Ormond Street Institute of Child Health and the University of Cambridge, focused on Focal Cortical Dysplasia, which is a major cause of epilepsy in children. It describes the way the brain fails to form normally, and because the abnormalities tend to be small, they tend to be very difficult to pick up on MRI scans.


Understanding Core Data Science Algorithms: K-Means and K-Medoids Clustering - DZone Big Data

#artificialintelligence

Clustering is one of the major techniques used for statistical data analysis. As the term suggests, "clustering" is defined as the process of gathering similar objects into different groups or distribution of datasets into subsets with a defined distance measure. K-means clustering is touted as a foundational algorithm every data scientist ought to have in their toolbox. K-means and k-medoids are methods used in partitional clustering algorithms whose functionality works based on specifying an initial number of groups or, more precisely, iteratively by reallocation of objects among groups. The algorithm works by first segregating all the points into an already selected number of clusters.


Python Packages for Data Science - DZone Big Data

#artificialintelligence

Python is one of the most widely used programming languages. Although standard Python does not offer too much, its insane number of open-source and third-party libraries holding its popularity amongst the developers. You just name the domain and Python will provide you with its best packages and libraries. Data Science and Machine Learning are two demanding technologies of this era, and Python is doing better than excellent in these two fields. Apart from Python, R is another programming language that often used in Data Science projects. R is faster and contains more computational and statistical libraries; however, in this article, we have only covered the top Python Data Science Libraries which you should know if you want to master Data Science.


A Complete Guide To Math And Statistics For Data Science - DZone Big Data

#artificialintelligence

Math and Statistics for Data Science are essential because these disciples form the basic foundation of all the Machine Learning Algorithms. In fact, Mathematics is behind everything around us, from shapes, patterns, and colors, to the count of petals in a flower. Mathematics is embedded in each and every aspect of our lives. Although having a good understanding of programming languages, Machine Learning algorithms and following a data-driven approach is necessary to become a Data Scientist, Data Science isn't all about these fields. In this blog post, you will understand the importance of Math and Statistics for Data Science and how they can be used to build Machine Learning models.


10 Reasons to Learn Python in 2019 - DZone Big Data

#artificialintelligence

If you follow my blog regularly then you may be wondering why am I writing an article to tell people to learn Python? Didn't I ask you to prefer Java over Python a couple of years ago? Well, things have changed a lot since then. In 2016, Python replaced Java as the most popular language in colleges and universities and has never looked back. Python is growing big time.


Top Data Science Tools - DZone Big Data

#artificialintelligence

Whether you're building a house or a Fortune 500 corporation, having the right tools available is essential. Today's businesses are undertaking a different kind of building and require a different set of tools. As companies across the spectrum turn their attention to building a data analytics strategy to remain competitive in a digitally-focused global environment, they'll need an assortment of data science tools capable of slicing, dicing, and operationalizing enterprise data in myriad ways. A well-planned and executed data strategy will enable your business to make more business-critical decisions faster, and more accurately predict the outcomes of those decisions. But knowing which tools to use can be a real conundrum.


Why Use K-Means for Time Series Data? (Part Two) - DZone Big Data

#artificialintelligence

In "Why Use K-Means for Time Series Data? (Part One)," I give an overview of how to use different statistical functions and K-Means Clustering for anomaly detection for time series data. I recommend checking that out if you're unfamiliar with either. I am borrowing the code and dataset for this portion from Amid Fish's tutorial. Please take a look at it, it's pretty awesome. In this example, I will show you how you can detect anomalies in EKG data via contextual anomaly detection with K-Means Clustering.


Big Data and Artificial Intelligence - DZone Big Data

#artificialintelligence

Recently, I gave a two hour lecture on "Big Data and Artificial Intelligence," to use some buzzwords, as asked. More honestly, it will be on (new) data and (new) algorithms for predictive modeling.


The Skills That Data Analysts Need to Master - DZone Big Data

#artificialintelligence

This seems very simple, but, in fact, it's not. Excel can not only do simple two-dimensional tables, complex nested tables, but also create line charts, column charts, bar charts, area charts, pie charts, radar charts, combo charts, and scatter charts. Although you are a business analyst, if you can rely on IT and IT tools (such as a multi-dimensional BI analysis model) sometimes you can't get the data you want. These skills will definitely attract the attention of senior leaders, as it allows them to understand at a glance, and gain insight into, the essence of the business. Summary: At this point, if you've mastered 80% of the above skills you can be considered a qualified analyst.